Fast Labeling and Transcription with the Speechalyzer Toolkit
نویسنده
چکیده
We describe a software tool named “Speechalyzer” which is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech to text, text to speech and speech categorization software systems.
منابع مشابه
An investigation of neutron direct damages at energies of 0.1-2 MeV on the DNA molecules with atomic structure deduced using Geant4 toolkit
This study proposes a method to estimate RBE of fast neutrons using Monte Carlo simulations. This approach is based on the combination of an atomic resolution DNA geometrical model and Monte Carlo simulations for tracking particles. Atomic positions were extracted from the Protein Data Bank. The GEANT4 code was used for tracking the secondary particles generated by fast neutrons during their in...
متن کاملCalculation of Positron Distribution in the Presence of a Uniform Magnetic Field for the Improvement of Positron Emission Tomography (PET) Imaging Using GEANT4 Toolkit
Introduction Range and diffusion of positron-emitting radiopharmaceuticals are important parameters for image resolution in positron emission tomography (PET). In this study, GEANT4 toolkit was applied to study positron diffusion in soft tissues with and without a magnetic field for six commonly used isotopes in PET imaging including 11C, 13N, 15O, 18F, 68Ga, and 82Rb. Materials and Methods GEA...
متن کاملNiuParser: A Chinese Syntactic and Semantic Parsing Toolkit
We present a new toolkit NiuParser for Chinese syntactic and semantic analysis. It can handle a wide range of Natural Language Processing (NLP) tasks in Chinese, including word segmentation, partof-speech tagging, named entity recognition, chunking, constituent parsing, dependency parsing, and semantic role labeling. The NiuParser system runs fast and shows state-of-the-art performance on sever...
متن کاملA Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance
Current state-of-the-art speech synthesizers for domain-independent systems still struggle with the challenge of generating understandable and natural-sounding speech. This is mainly because the pronunciation of words of foreign origin, inflections and compound words often cannot be handled by rules. Furthermore there are too many of these for inclusion in exception dictionaries. We describe an...
متن کاملInvestigation of the direct DNA damages irradiated by protons of different energies using geant4-DNA toolkit
Background: The total yields of direct Single-Strand Breaks (SSBs) and Double-Strand Breaks (DSBs) in proton energies varying from 0.1 to 40 MeV were calculated. While other studies in this field have not used protons with energy less than 0.5 MeV, our results show interesting and complicated behavior of these protons. Materials and Methods: The simulation has been done using the Geant4-DNA too...
متن کامل